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The  basic  model.  In  a  number  of  papers  [1,2, 4, 5, 6]  nonCaussian 
linear  processes  are  considered  as  the  basic  model.  Questions 
relating  to  the  estimation  of  coefficients  and  deconvolution 
were  dealt  with.  We  give  the  assumptions  here.  Let  be 

a  sequence  of  independent,  identically  distributed  random  var- 
iables  with  E  v^  h  0,  E  v^  =  1  and  some  higher  order  cumu- 
lant  F*  o  (s  >  2)  .  The  real  coefficients  {a.}  are  in 

<11  !  • 

It's  assumed  that  one  observes  the  linear  process 

(2)  =  I  a,v^_j 

without  knowledge  of  the  o^'s  or  thev^'s.  The  object  was  to 
deter'r,ine  procedures  for  the  estirnation  of  the  ooeffioieKts 
oij  whether  or  not  the  system  is  minimum  phase  and  effect  de¬ 
convolution  to  estimate  the  v^'s. 

We  shall  still  look  at  aspects  of  the  problem  just  described. 
But  our  main  concern  will  be  with  the  modified  problem  in 
which  we  observe  only 

(3)  ’^t  ^  ^t  "^t 
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where  has  the  structure  given  in  (2)  and  is  a 

Gaussian  noise  process  independent  of  the  process  (X^) . 

In  many  cases  the  spectral  range  of  the  X  and  n  processes 
will  be  disjoint.  In  such  a  case  direct  linear  filtering  will 
let  us  get  the  process  (X^).  This  happens,  for  example,  when 
the  spectrum  of  the  n*s  is  in  the  high  frequency  range  and 
that  of  the  X  process  in  the  low  frequency  range.  Suppose 
the  interval  [-a,aj  contains  the  X  spectral  mass  and  that 
of  the  n  process  is  outside.  Further,  let  us  assume  that 
±a  are  continuity  points  of  the  spectral  distribution  func¬ 
tion  F  of  Y.  The  indicator  function  of  I-a,a]  has  the 
Fourier  representation 

1  sin  ia  .-ijX 
—  .  e 

and  this  implies  that 

_  r  1  sin  ja 
*t  ~  4  It  j  *t-j  * 

Dwyer  [3]  has  been  concerned  with  techniques  aimed  at  gaug¬ 
ing  the  Gaussian  or  nonGaussian  character  of  additive  compo¬ 
nents  of  the  process  Y  corresponding  to  different  spectral 
ranges.  He  tries  to  assess  the  third  and  fourth  order  moment 
properties  of  such  components. 

VJe  shall  be  interested  in  seeing  what  happens  when  the  spec¬ 
tral  ranges  of  the  X  and  n  processes  overlap.  If  the  n 
process  is  small  compared  to  the  X  process,  one  can  still 
try  to  deconvolve  approximately  even  by  proceeding  naively  as 
if  n  weren't  there.  An  example  of  such  a  naive  convolution 
is  given  below.  All  this  is  independent  of  whether  the  system 
is  minimum  phase. 


I 

3 


In  Figures  1  and  2  the  process  Y^  is  generated  by  a  Monte 
Carlo  simulation  where 


'^t  ■  ''t-1  ''t-2 


and 


v^  is  a  sequence  of  exponential  independent  random  var¬ 


iables  of  variance  one.  Here  the  roots  of  the  polynomial 


1  -  3.5z  +  1.5z^  are  2  and  1/3.  The  additive  noise 


consists  of  independent  Gaussian  variables  of  mean  zero  and 

2  2 
variance  o  *  1  in  the  case  of  Figure  1  and 


0“  =  2  for 

Figure  2.  The  first  line  of  the  figure  graphs  the  sequence 
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The  second  line  graphs  the  generating  sequence.  The 

third  line  gives  the  result  of  our  naive  deconvolution  ne¬ 
glecting  n .  The  fourth  line  gives  the  error  in  our  decon¬ 
volution.  The  last  line  has  the  result  of  a  minimum  phase 
deconvolution  which  is  naturally  off  because  one  of  the  roots 

is  inside  the  unit  disc  in  the  complex  plane.  Clearly  our 

2 

naive  deconvolution  gets  worse  as  a  increases.  But  even 
2 

for  a  =2,  a  moderate  amount  of  noise,  one  can  still  rec¬ 
ognize  certain  broad  features  of  the  v  sequence.  If  n  is 
large  one  won't  be  able  to  deconvolve  but  one  can  still  hope 
to  estimate  a  good  deal  about  the  a^’s  under  appropriate  con¬ 
ditions  . 


A  simple  model.  To  give  some  idea  of  the  difficulties  that 
can  arise,  let  us  consider  the  case  in  which 
(4)  Y^  =  a  v^  +  b  u^ 

with  the  (v^)  sequence  nonGaussian  independent,  identically 
distributed  with 

EVj.=o,  Ev^  =  l,  Ev^  =  Y^0 

and  {u^}  an  independent,  identically  distributed  sequence 
of  N(0,1)  random  variables.  The  and  sequences 

are  assumed  to  be  independent.  It  is  assumed  that  |a{,  |b| 
and  Y  are  unknown.  The  counterpart  of  the  problem  mentioned 
above  is  that  of  estimating  |a|  and  |b{.  {a|  and  |b| 

are  not  identifiable  in  terms  of  the  problem  as  specified  here 
since  v^  might  be  decomposable 


(5) 


+  V 


(2) 

t 


in  terms  of  two  nontrivial  independent  summands,  one  of  which, 

(2) 

say  ,  is  Gaussian.  The  problem  can  be  normalized  by 

insisting  that  v^  be  indecomposable  in  the  sense  that  a  repre¬ 
sentation  of  the  form  (5)  be  impossible.  We  shall  call  a 
random  variable  v^  with  a  nontrivial  decomposition  of  the 
form  (5) ,  in  that  a  Gaussian  summand  with  positive  variance 
exists,  reducible.  This  could  be  expressed  in  terms  of  dis¬ 
tribution  functions.  A  distribution  will  be  called  reducible 
if  it  has  a  nontrivial  Gaussian  component. 


7v. 
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PROPOSITION.  A  reducible  distribution  has  a  maximal  decompo¬ 
sition  relative  to  its  Gaussian  component . 


Let  ip  be  the  characteristic  function  of  the  reducible  dis¬ 


tribution.  Then  there  are  constants 


‘=n  " 


0  and  correspond¬ 


ing  characteristic  functions  ip 


such  that 
^2 


<p(t)  =»  exp  (-  —) 


Then 


Re  (p(t)  <  Re  <  1 


Let  the  F  be  the  distributions  corresponding  to  the  charac¬ 


teristic  functions  ip 


We  have 


/  d  F  (x)  <  —  {1  -  Re  Ip  (v)  )dv 

|x|>l/u  0  " 


< ;  r  u 


Re  4>  (v)  }dv 


We  have  then  uniform  bounds  on  the  tails  of  the  distributions 

F^.  One  can  therefore  choose  a  subsequence  F^j^  with  c^j^  + 

c  =  sup  c  that  converges  weakly.  Let  the  limiting  distribu- 
n 

tion  be  F  with  corresponding  characteristic  function  ip . 


It  then  follows  that 


(6)  (p(t)  »=  ip(t)  exp  (-  c  -j-) 

which  corresponds  to  the  unique  maximal  decomposition. 


The  problem  of  estimating  |a|, 
irreducible  is  one  in  which  |a| 


now  in  (4)  where  v. 


are  identifiable. 


IS 

How¬ 


ever,  the  problem  of  deconvolution  which  is  that  of  estimating 


is  clearly  not  meaningful. 


The  representation  (6)  implies  that  any  distribution  with  a 
characteristic  function  4>(t)  that  doesn't  decrease  to  zero 
as  fast  as  the  Gaussian  as  |t|  -*  «>  must  be  irreducible. 
This  means  that  all  discrete  distributions  and  all  gamma  dis¬ 
tributions  must  be  irreducible. 


Let's  consider  the  question  of  predicting 
We  shall  put  this  in  the  form 


a  v.. 


given  Y 


V  -  e  +  n 

2 

where  c,  and  n  are  independent  with  means  zero  and  variances  o. 


2  (31 

2  2  2  5'(V)  +  -4  (Y)  +  .. 

E(C''|y)  =  +  2Y  o\  - ^ - 

g(Y)  +  -^  g'1[Y)  +  , .  . 

2  g(Y)  +  I  02  g"(Y)  +  .  .  . 


g{Y)  +  g"(Y)  + 


We  should  like  to  compare 


E((C-aY)^|y)  =  E(C^|Y)  -  2ayE(5|Y)  +  a^Y^ 


tv 


E(  (C-E(cIy) )^1y)  =  ECC^Iy)  -  E(elY)^ 


It  is  clear  that 


E( (C-E(C! Y) )^jY)  <  E( (C-aY) ^|y) 


and  the  difference  is  given  by 

(aY-E(5!Y))^  =  ((l-a)Y  +  +  0(02))^  . 

Such  a  comparison  can  give  us  some  idea  of  the  effectiveness 
of  the  best  predictor  versus  the  linear  predictor  in  the  tail 
region  of  the  g  distribution.  This  tail  region  may  be  of 
greatest  interest  in  certain  deconvolution  problems  (see 
Wiggins  [6] ) . 

Up  to  this  point  we  have  considered  prediction  in  mean  square. 
Suppose  we  consider  instead  trying  to  minimize 

EU-f  (Y) 

2 

for  some  appropriate  f  when  Y  =  C+h  with  n  N(o,02)  and 

2 

C  irreducible  nonnormal  with  mean  zero  and  variance  0^.  An 
analysis  can  be  carried  out  by  considering 

E(C-f(Y))^  =  E(E((C-f (Y))^|y])  . 

We  can  obviously  minimize  this  if  we  can  minimize  the  condi¬ 
tional  fourth  moment  on  the  right  side  for  each  Y.  This  sug¬ 
gests  that  we  consider  minimizing 


i  i 

rr.  4 


r  V-.  .-wV-i 


E(Z-c)’ 

4 

in  c  where  Z  is  a  random  variable  with  E  Z  <  “  and  c 
is  a  constant.  Let  m  be  the  mean  of  Z  with  a  =  m  -  c. 


m 


E(Z-c)^  *  E{Z-m+a)^  =  +  bo^o^  +  4p2a  + 


LsiL-VllhS 


r-.-V-Ji 


with  o  the  variance  of  Z  and  the  j  central  moment. 

Differentiating  this  with  respect  to  a  we  get 

(8)  4a^  +  12a^a  +  4y2  • 

If  y^  =  0  the  unique  real  zero  is  a  =  0  and  the  minimum 
is  attained  by  c  =  m.  If  0  we  have  a  unique  real  zero 

since  the  derivative 

12a^  +  12o^ 

of  (8)  is  positive.  The  zero  will  be  negative  if  y^  >  0 
and  positive  if  y^  <  0.  Set  (8)  equal  to  zero  and  solve  for 
a.  By  Cardano's  formula  one  obtains 


(9)  a  =  2"^/^  [-  y^  +  [u]  + 

-  2-1/3  ,  f^2  ^  ,,651/2^1/3 

This  means  that  the  function  f(Y)  minimizing  (7)  is 


f{y)  =  m(y)  -  a(y) 


where 


m(y)  =  E(Cly) 

and  a(y)  is  given  by  expression  (9)  with 


y^  =  E((5-E(5lY))-’|V) 


=  E((C-E(C|y) )^ly)  . 


Estimation  of  coefficients.  Let  ®  process  of  type 

indicated  in  formula  (3),  that  is,  the  sum  of  a  linear  non- 
Gaussian  process  and  an  independent  Gaussian  noise  process. 

We  shall  indicate  in  a  simple  way  that  the  coefficients 
of  the  linear  nonGaussian  process  can  be  estimated  up  to  an 
undetermined  multiplier  and  an  undetermined  time  shift.  This 
is  an  asymptotic  argument. 


PROPOSITION .  Let  °  linear  nonGaussian  process  per¬ 

turbed  by  independent  Gaussian  noise.  Suppose  that  J 
<  a(i  11)  ?  0  for  all  X  and  ^  0.  Then  the  coeffi¬ 

cients  0|^  can  be  consistently  estimated  up  to  an  undetermined 
multiplier  c  and  an  unspecified  time  shift  of  the  index  set. 


The  bispectral  density  of  the  process  {Y^}  is 

Y,  i(X,+X-) 

=  - ^-0(6  •^)Q(e  ^)o(e  ^  . 

^  ^  ^ 

As  before  one  can  estimate  the  phase  of  a (e  consistently 

up  to  an  undetermined  additive  term  ikir  with  k  integral. 
This  can  be  accomplished  by  using  bispectral  estimates  as  in 
Lii  and  Rosenblatt  t4J.  Notice  that 

(10)  b,(X,0)  =  a(l)  la(e"^^)l^  . 

(21T)'' 

Bispectral  estimates  (using  (10))  then  allow  us  to  estimate 
the  spectral  density  of  the  process  (X^)  up  to  an  undeter¬ 
mined  constant.  The  proposition  follows  immediately  from 
these  remarks. 

Corollary.  Let  the  assunptions  of  the  proposition  he  satis¬ 
fied.  Suppose  that  the  Gaussian  noise  is  vhite  and  that 
lci(e  ^^)1  ?  constant  for  all  X.  One  can  then  estimate  the 
multiplier  c.  There  is  then  only  an  unspecified  time  shift 
of  the  index  set.  The  varianc..  of  r  can  also  be  estimated . 


Since  the  assumptions  of  the  proposition  are  satisfied  the 
conclusions  hold.  We  have  only  to  estimate  the  multiplier  c. 
Both  the  spectral  density 

2 

fl (X)  =  ^  !a(e  )  I  + 


of  the  Y  process  and 


b(X,0)  = 


2tt 


a(l)  la(e"^^)  1^ 


(2Tt)' 


can  be  estimated  consistently.  Clearly 


fj(X)  = 


2tt _ 

Y^aTTT 


b(X,0)  + 
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=  A  b(X,0)  +  B  . 

The  coefficients  A,  6  can  be  consistently  estimated  given 
consistent  estimates  of  fj^(X),  b(X,0).  The  conclusion  then 
follows . 


The  argument  of  a  polynomial  transfer  function.  In  many  cases 
it  seems  reasonable  to  assume  that  a(z)  is  a  polynomial 

P  . 

a(Z)  =  I  o.Z^  . 
j=0  3 

Because  the  coefficients  are  assumed  real,  the  roots  Zj  are 
real  or  occur  in  complex  conjugate  pairs.  The  argument  of 
a(e  is  the  sum  of  the  contributions  from  the  factors  cor¬ 

responding  to  the  roots.  We  shall  consider  the  contribution 
from  a  root  Z^  with  |Zj|  ^  1.  Suppose  Z^  =  re~^®  with 
0  <  r  <  1  and  -  it  <  6  £  it .  Then 

log  (e"^^-ZJ  =  -  iX  +  log  (l-e^^Z^) 

=  -  iX  -  I  (e^^Z  )Vk 
k=l  J 

and  this  implies  that 

(11)  im  log  (e'^^-Z.)  =  -  X  -  ?  ^(^-8) 

^  k=l 

for  |Zj|  <  1  . 

-•16 

Similarly  for  Zj  =  re  with  r  >  1  one  has 

log  (e'^^-Zj)  =  log  (-Z^)  +  log  (1-Z~^e"^^) 

00 

=  iit  +  log  z.  -  I  (Z~^e"^^)’'A 
3  k=l  J 

and  this  implies  that  we  have  a  representation 

(12)  Im  log  (e~^^-Z.)  =  it  -  e  -  I  - - > 

^  k=l 

for  I Z j I  >  1.  Let  us  formally  consider  the  difference  between 
(11)  and  (12)  for  r  =  If  that  is 

(13)  e  -  X  -  It  +  2  I  ^  . 

k=l 

Notice  that  the  Fourier  series  of  x  -  it  sgn  x  is 


-2  \  2  sin  kx 

k=l  ^ 

This  implies  that  the  expression  (13)  equals  zero  for  0  <  6 
-  X  <  It  and  -  2ii  for  -  it<0-X<O.  This  is  consistent 
with  the  indeterminacy  of  the  argument  up  to  a  multiple  of  2it. 
For  a  real  root  |z.|  <  1  expression  (11)  becomes 


i . 


-  ^  ^  I 

k=l 


r  sin  ki 


according  as  to  whether  6  =  0  or  it.  In  the  case  of  com¬ 
plex  conjugate  roots  of  absolute  value  less  than  one  the  sum 
of  the  contributions  is 


2\  -  2  I 

k=l 


r  sin  kA  cos  kO 


The  corresponding  remarks  for  roots  greater  than  one  read  as 
follows.  If 


[Zjl  >  1  is  real  expression  (12)  is 


(1  -  6„  1  I 

k=l 


“k  ,  , 
r  sin  k>. 


with  1  according  as  to  whether  6  =  o  or  tt  .  If  there  are 
complex  conjugate  roots  one  obtains 

a>  _)( 

2:t  +  2  I  - 


sin  k>  cos  k6 


k=l 


In  our  case  we  try  to  estimate  the  sum  of  the  contributions 
to  the  argument  from  all  zeros  and  then  as  a  convention  re¬ 
adjust  the  value  at  zero  of  the  sum  so  that  it  is  zero  there. 

This  discussion  of  the  relationship  between  the  argument  of 
the  transfer  function  and  zeros  of  a(Z)  is  natural  when 
a (2)  is  a  polynomial.  A  corresponding  discussion  could  be 
carried  out  if  a(Z)  is  a  rational  function  in  terms  of  the 
zeros  and  poles  of  o(Z).  However,  one  can  easily  give  ex¬ 
amples  of  analytic  functions  a (2)  with  no  zeros  such  as, 
for  example, 

a (2)  =  exp  (sin  2}  . 


Here,  there  would  be  no  meaning  to  such  an  analysis. 


-■.-A) 
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